II. AMENDMENTS TO THE CLAIMS 

The following listing of claims replaces all prior versions, and listings, of claims in the 
application: 

1 . (Currently amended) A computer implemented method of producing a linguistic 
dictionary, the method comprising: 

obtaining orthographic variations of dictionary words; 

explicitly storing substantially all orthographic variations of words in a finite state 
transducer database, and 

generating, for each of the orthographic variations, a cut and paste code, 

wherein the cut and paste code includes code which indicates how many 
characters should be cut from the end of a surface form of a word and pasted to produce a 
particular variation, 

wherein the cut and paste code is extended by a gloss code that indicates whether 
at least part of the orthographic variation should be converted between upper and lower 
case; and 

storing, for each of the orthographic variations, the associated extended cut and paste 

code. 

2. (Original) The method of claim 1, wherein the gloss code further indicates whether 
conversion should be performed between each single and double character sequence in the 
orthographic variation. 
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3. (Original) The method of claim 1, wherein the gloss code indicates one of (i)-(vii): 

(i) Do nothing; 

(ii) Convert first character to upper case; 

(iii) Convert first character to lower case; 

(iv) Convert word to lower case; 

(v) Convert word to upper case; 

(vi) Convert word to upper case and replace each single character sequence with 
equivalent double character sequence; and 

(vii) Convert word to lower case and replace each double character sequence with single 
characters. 

4. (Original) The method of claim 1, further comprising: 

storing, for each word having an accented character: 

a word having a composite form of the accented character; and 

a word having an expanded form of the accented character that includes a base character 
and an accent character. 

5. (Currently amended) A computer implemented linguistic dictionary comprising: 

a computer, having: 

a finite state transducer database for explicitly storing substantially all 
orthographic variations of words; and 

a means of generating and storin g, for each of t he generated orthographic 
variations,, comprising : a cut and paste code, 
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wherein the cut and paste code includes code which indicates how many 
characters should be cut from the end of a surface form of a word and pasted to 
produce a particular variation, 

wherein the cut and paste code is extended by a gloss code that indicates 
whether at least part of the orthographic variation should be converted between 
upper and lower case,, 
wherein the computer further comprises: a data carrier including one of a magnetic 
computer disk and an optical computer disk, and a processor. 

6. (Original) The linguistic dictionary of claim 5, wherein the extended gloss code further 
indicates whether conversion should be performed between each single and double character 
sequence in the orthographic variation. 

7. (Original) The linguistic dictionary of claim 5, wherein the extended gloss code indicates 
one of (i)-(vii): 

(i) Do nothing; 

(ii) Convert first character to upper case; 

(iii) Convert first character to lower case; 

(iv) Convert word to lower case; 

(v) Convert word to upper case; 

(vi) Convert word to upper case and replace each single character sequence with 
equivalent double character sequence; and 

(vii) Convert word to lower case and replace each double character sequence with single 
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characters. 



8. (Original) The linguistic dictionary of claim 5, wherein the database stores, for each word 
having an accented character: 

a word having a composite form of the accented character; and 

a word having an expanded form of the accented character that includes a base character 
and an accent character. 

9. (Currently amended) A computer program product encoded on a computer readable 
storage medium comprising computer program means executable by a computer for performing 
substantially the steps of: 

obtaining orthographic variations of dictionary words; 

explicitly storing substantially all orthographic variations of words in a finite state 
transducer database, and 

generating, for each of the orthographic variations, a cut and paste code, 

wherein the cut and paste code includes code which indicates how many 
characters should be cut from the end of a surface form of a word and pasted to produce a 
particular variation, 

wherein the cut and paste code is extended by a gloss code that indicates whether 
at least part of the orthographic variation should be converted between upper and lower 
case; and 

storing, for each of the orthographic variations, the associated extended cut and paste 

code. 
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10. (Original) The computer program product of claim 9, wherein the extended gloss code 
further indicates whether conversion should be performed between each single and double 
character sequence in the orthographic variation. 

1 1 . (Original) The computer program product of claim 9, wherein the extended gloss code 
indicates one of (i)-(vii): 

(i) Do nothing; 

(ii) Convert first character to upper case; 

(iii) Convert first character to lower case; 

(iv) Convert word to lower case; 

(v) Convert word to upper case; 

(vi) Convert word to upper case and replace each single character sequence with 
equivalent double character sequence; and 

(vii) Convert word to lower case and replace each double character sequence with single 
characters. 

12. (Original) The computer program product of claim 9, further comprising computer program 
means for storing, for each word having an accented character: 

a word having a composite form of the accented character; and 

a word having an expanded form of the accented character that includes a base character 
and an accent character. 
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13. (Previously Presented) The method of claim 1, wherein the extended code is adapted to 
allow a single orthographic variation to indicate a variation for a plurality of root words. 
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